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Abstract 

The concept of temperature is one of the key ideas in describing the thermodynamical properties of a physical 
system. In classical statistical mechanics of ideal gases, the notion of temperature can be described in two different 
ways, the kinetic temperature and the thermodynamic temperature. For the Boltzmann distribution, the two notions 
lead to the same result. However, for a general probability density function, while the kinetic temperature has been 
commonly used, there appears to be no corresponding general definition of thermodynamic temperature. In this 
paper, we propose such a definition and show that it is connected to the Fisher information associated with the 
distribution of the momenta. 
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I. Introduction 

The inverse of the thermodynamic temperature of a system in equilibrium can be defined as the rate 
of change of entropy with energy [1]. It can also be thought of as a measure of the kinetic energy of the 
particles composing the system. However, for a system that is not in equilibrium, while it is possible to 
define a temperature using the kinetic energy of the system, there appears to be no commonly accepted 
notion of thermodynamic temperature. 

In this paper, we introduce a definition of thermodynamic temperature for any probability density 
function (PDF) on the momentum of the particles. Our main contribution in this paper is to consider a 
particular form of perturbation of the momentum that increases the entropy and the energy associated 
with the distribution. This perturbation can be thought of a statistical realization of "heating". Then, we 
define the thermodynamic temperature as the rate of change of entropy with energy for this particular 
form of perturbation. 

We first consider the case when the momentum is a continuous random variable in Section [Till. For 
the continuous case, by using the de Bruijn identity [2, 3] which provides a relationship between rate 
of change of entropy with a parameter in the perturbation and the Fisher information, we show that the 
thermodynamic temperature is the Fisher information associated with the probability density function of 
the momentum, i.e., and, hence, does not depend on form of the perturbation. The main result of this paper 
is that when the momenta are a vector valued random variable P with 2>N components, the thermodynamic 
temperature 6 is given by 

1 dS 2mk ^ f°° , ^ ( d x 1 



E/ /^b ln ^ d y- (1) 



dS 3N Z-.J ^^^ygy. 

Our definition of thermodynamic temperature coincides with the conventional definition of temperature 
when the distribution is the Boltzmann distribution (steady-state distribution) but is more general since it 
is applicable to any distribution on the momentum. 



In Section IV we consider the case when the momentum is a discrete random variable and propose 
a particular form of perturbation that can be used to define the thermodynamic temperature. For this 
case also, we establish a relationship between the thermodynamic temperature and a statistical quantity 
associated with the distribution, which can be thought of as the discrete counterpart of Fisher information. 



II. Classical Definitions of Temperature 

Throughout the paper, we use the following notation: vector valued random variables are represented 
by capital letters with an arrow such as P and their realizations are denoted by lowercase letters with an 
arrow such as y. The PDF of the random variable P evaluated at y is denoted by fp(y). Scalar random 
variables are denoted by capital letters and their realizations by lower case letters. Quantities associated 
with a random variable which are only functions of the PDF are denoted as functionals of the PDF instead 
of functions of the random variable. For example, the entropy of P is denoted by S(fp) instead of S(P). 
All logarithms considered are natural logarithms. 
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Consider a system with N particles and let the states be represented by the random variables [X, P], 
where P E R 3N denotes the momenta and X e R 3N denotes the positions of the particles. Let fgp(x,y) 
be the joint probability density function of the position, fg(x) and fp(y) are the marginal PDF's of the 
position and momentum, i.e., 

fx(%) = J fgp&if) d V, fp(v) = J fxp(XiV) d %- 

3N 3N 

In the above equation dy = Y\ d V% an d dx = Y\ d x i- 

i=l i=l 

For an equilibrium distribution, we have the classical Boltzmann formula that 



T 



1 2 



where 4p- + v(x) is the Hamiltonian of the system and T is the temperature of the system. Hence, the 
marginal fp(y) is a Gaussian distribution with zero mean and variance 2mS(fp), where £(fp) is the 
kinetic energy of the system given by 

E[\\P\f] = ± 
2m 2m 

The thermal entropy of the distribution S, is given by 



W?) = - J1 t A1 = —„ I \\y\\ 2 fp(v)dy- (2) 



S(fp) = -kJ fp(y) Hfp(y)) dy, (3) 

where k is the Boltzmann constant. At equilibrium, the temperature of the system is related to its kinetic 
energy through the standard relationship 

(3/2)NkT = S(fp). (4) 

For future reference, we will call T as the kinetic temperature. Notice that T is proportional to the variance 
of the momentum and, hence, the kinetic energy of the system. 

From a thermodynamic point of view, for a system in equilibrium, an alternate definition of temperature 
can be obtained through the fundamental equation of state [1], from which we get 



1 _ dS(fp) 



(5) 



9 d£{f P Y 
where 9 is the thermodynamic temperature. 

For a system in equilibrium, it well known that the kinetic temperature defined in ([4]) is identical to the 
thermodynamic temperature defined in ([5]), i.e. 9 = T. Now, the question arises whether one can define 
these quantities for a system not in equilibrium, i.e., one for which fp(y) is not Gaussian. It is clear that 
the kinetic temperature can be defined exactly as in ([4]) for any distribution. For a general non-equilibrium 
distribution, if we adopt the Shannon entropy of a distribution as the equivalent of Gibbs' entropy as is 
usually done, then there is no equation of state in general. Hence, it is not possible to directly define 
d fJff P ^ even though S(fp) and £(fp) are well defined as in (i| and (2). Particularly, note that, away from 
equilibrium S(fp) is not necessarily even a function of £(fp). 
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III. Perturbation Approach and Proposed Definition when Momentum is a Continuous 

Random Variable 

Since both S and £ are functions of fp, we can treat the distribution fp itself as a parameter (i.e., we 
use the distribution fp as descriptor of the macrostate of the system) and intuitively define thermodynamic 
temperature as 

dS(fp)/df p 
dS(fp)/dfp 

In other words, given a probability distribution fp, we perturb it to fp, Qand calculate the corresponding 
perturbations in S and £, namely AS" and AS. Then, we can define thermodynamic temperature as 

1 , AS 

- = Km — . (7) 
9 as^oAS 

At a minimum, the temperature obtained by perturbing fp should satisfy the following criteria 

1) 9 must be non-negative 

2) 9 should be equal to the thermodynamic temperature for the Gaussian distribution 

3) 9 should represent "spread" of the kinetic energy, i.e., the more spread out the kinetic energy, the 
higher temperature 

4) 9 should be a functional of the PDF fp 

Not all perturbations of the probability distribution would give rise to sensible definitions of temperature. 
In fact, it is quite possible to perturb the distribution in a way which will produce unconventional results 
such as temperature being negative. The following example illustrates this 

Example 1 For the sake of this example, consider a scalar random variable and for any two a, b, such 
that b > a > 0, let U[ a ^{y) be the uniform distribution between a and b, i.e., 

, x -r—, a < y < b; 

u [a , b] (y) = \ - . (8) 

I 0, otherwise. 

and let the probability distribution fp(y) be 

fp{v) = \u[-b,-a\(y) + 2 U m(v)- 



Consider a perturbation of fp(y) to fp>(y) given by 

1 
2 



fp'(y) = n^[_i_A,-a-A](3/) + U l±+A&+A](y)- 



It can be seen that 



S(f P ) = ln(6-a), (9) 
S(f P ,) = ln(— V (10) 



7 

1 h 3 — a 3 

1 (^ + A) 3 -(^ + A) 3 

£ <« - L 7 % 7 • (l2 > 

'Note that we use fp, to denote the density function of the perturbed distribution instead of fp. It must be understood that the random 
variable P is perturbed to obtain a new random variable P' whose PDF is fp,. 
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From (10), it can be seen that when j — 1, AS = S(fp,) — S(fp) = 0. However, for any A > 0, 
AS > and for this example, = 0. By choosing appropriate values for 7 and A, it is possible to get 
negative values of Thus demonstrating the fact that not perturbations are suitable for a meaningful 
definition of temperature. We will now introduce a specific form of perturbation for which the temperature 
defined in (Bl) will satisfy conditions 1-4 mentioned in Section III 



A. Additive Perturbation 

From the macroscopic perspective, one can view the definition of thermodynamic temperature in ([5]) as 
the mathematical embodiment of the following thought experiment. We increase the total kinetic energy 
of the particles by a small amount by "heating" the system. Then, we measure the change in entropy of 
the system. The ratio of the change in entropy to the change in energy is the inverse of the temperature. 
The key point to observe here is that, this thought experiment depends upon the notion of heating the 
system which guarantees both the entropy and energy increase (i.e., some sort of a diffusive process). We 
now propose a statistical realization of this notion. 

Motivated by the kinetic theory interpretation of heating as due to the random collision of particles 
with uncorrected momenta, we consider an additive perturbation of the following form. Let P be the 
random variable which represents the momentum and consider a new random variable P' given by 

P' = P + ^/SQ : (13) 

where Q is any random variable with zero mean and unit variance in each dimension and let the 
components of Q be independent of each other i.e., E[QQ T ] = l3Nx3N- Further, let Q be independent of 
P. Let the PDF of Q be and that of P' be fp,. Then, fp, is given by 

fp(M = f?W<afvulW> (14) 

where cg> represents convolution. Note that fp,(y, 5) is explicitly a function of 5 also. Since P and Q are 
independent, the following two relations hold: 

(i) S(fp,) > S(fp) 

This can be proved as follows. Since conditioning cannot increase entropy [2], 

S(fp.) > Sfatf) = S(f p+m ) = S(fp) (15) 

where fpn§ is the conditional distribution of P' given Q and S(fp,^) is the entropy of P' given Q. 
The last equality follows from the independence of P and Q. . 

(ii) £{f p ,) = £{f p ) + 3 £5. 

This follows from the fact that Q is a random variable with zero mean which is independent of P 

and, hence, E[\\P'\\ 2 } = E[\\P\\ 2 } + E[(V5Q) 2 ] = E[\\P\\ 2 } + §£<f. 
In other words, this method of perturbation is a way to increase the energy by an known amount 
which is also guaranteed to increase the entropy. Furthermore, as will be seen later, fp, (y, 5) satisfies the 
diffusion equation as 5 — > in 3iV dimensions. In other words, adding an independent random variable 
to the momentum is equivalent to heating the body by a specified amount and as is to be expected, it also 
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increases the entropy. Hence, one would expect that the ratio between the change in entropy associated 
with this perturbation to the change in energy associated with this perturbation would be a measure of 
inverse temperature of the distribution. 

a) Proposed Definition of Temperature: Hence, we formally define the inverse temperature of the 
system to be 



1 , S(f pi ) - S(f p ) 
e E(f pi ) - £(f p ) 



2m S(f P ,)- 
3N *™ S 



S(f f 



2m d 
3N86 



S(fp,)\s=o 



(16) 



We will now show that the above quantity is independent of the actual perturbation fg in (13) and, 
hence, depends only on the distribution fp, which is intuitively pleasing. We also show that 1/8 is the 
trace of the Fisher information matrix corresponding to the distribution of fp with respect to the location 
family, scaled by mk/3N. 



B. Relationship between Temperature and Fisher Information 

Fisher information is a quantity that is commonly used in parametric estimation [6]. For a scalar random 
variable P with a probability distribution fp(y), the Fisher information with respect to the location family, 
namely J(fp), is given by 



Ah 



fp(y) 



d_ 
dy 



In/p(y) 



dy 



fp(y) 



T y fp{y) 
fp(y) 



dy. 



(17) 



Now, let us consider two distributions fp(y + t) and fp(y — t) which are shifted versions of fp(y), shifted 
by t to the left and right, respectively. Then, Fisher information can be expressed as 

1 



J(f P ) = Jim - (D(f P (y + t)\\f P ) + D(f P (y - t)\\f P )) 



(18) 



where D(f P \\g P ) is the relative entropy (or Kullback-Leibler distance) between the distributions f P and 
gp. This can be shown easily using the result in [6], where it is shown for a family of distributions 
fp(y;u) parametrized by u, 



1 



J(v ) = lim ^D(f P (y;uj + t)\\f P (y;u )) 



Now, considering two parametric families f P (y; to) = f P (y — cu) and fp(y; uS) = f P (y + to) and applying 
this result and taking the average we get the desired result. Note that for both these families J(uq) = J(fp) 



as defined in ( [17] ), which gives us the LHS of ( fT8| ). 

For a vector valued random variable P with 3iV components, the i, jth entry of the Fisher information 
matrix is given by 



fp(y) 



1 SN 



(19) 



We now show that the temperature defined in (fT6]) is related to the trace of the Fisher information 



matrix defined in (19), i.e., Ji : i(fp). The key result that we use to establish this connection is the de 
Bruijn identity [2] which is given below for scalar random variables. 
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Lemma 3.1: (de Bruijn Identity) Let P be a scalar random variable with a finite variance and let fp 
be the PDF of P. Let Q be an independent random variable with unit variance and PDF fg, which is 
symmetric about 0, i.e., /q(?/) = fq{— y)- Let P' be the random variable given by 

P' = P + y/SQ. 



Then, Arbitrary perturbation: For any /q 

dS(f'p) 



05 



1 5=0 



'Ah 



(20) 



(21) 



Gaussian perturbation: In the special case of Q being a Gaussian random variable, the identity can be 

dS(f P ,] 



strengthened to 



k 



(22) 



05 2 

The proof for the de Bruijn identity for the Gaussian perturbation can be found in [2]. However, the 
de Bruijn identity for an arbitrary perturbation is more relevant to us and a proof for this does not appear 
to be available in the literature (although the result appears to be known [4]). So, we prove the identity 



in (21 ) here. Further, our proof also reveals some interesting characteristics of the perturbation which are 
discussed in Section IIII-Cl 

Proof: Let <j)p(s) = e sy f P (y)dy and 4>q(s) = e sy fq{y)dy be the moment generating functions 
(MGF) of the random variables P and Q (i.e., 4>p(s) and 4>q(s) are the Laplace transforms of fp(y) and 
fq{y), respectively). The MGF of the random variable V6Q is simply (J) ^gn(s) = (J)q(\/Ss). 



For P' given in (20), let (f)p>(s,8) be the MGF of P' (note that we explicitly express the MGF as a 
function of 5). Since P and V5Q are independent, 4>p>(s,5) is given by 



4>p'(s,6) = (f)p(s) 



^ 5sy f Q (y) dy 



(23) 



Using a series expansion for e^ sy , it can be seen that 



e^f Q ( V ) dy = f2^^, 



i=0 

where /xq(«) is the ith moment of Q. Since, we have assumed that /q(— y) = fq{y), it can be readily seen 
that all the odd moments are zero and further, since we have assumed Q has unit variance, /Uq(2) = 1. 
Therefore, 

^ y f Q {y)dy = i + d ^ + Y, J ^M 2k )- 

Substituting the above result into the right hand side of ( [23] ), we get 

2 + ^ (2k)\ 

Taking the inverse Laplace transform on both sides, we get 

Sdy P (y) 



<pp'(s,6) = (j)p(s) 



1 + 



5 h s 2k 



(24) 



fp'(y,8) = fp(y) 



2 dy 2 



v-« fe (0 ,,d 2k f P (y) 



(2k)V 



dy 



2 k 



(25) 



8 



Differentiating the above equation with respect to 5, we get 

dfp,(y,6) ld 2 f P (y) 



05 2 Oy 2 +0{6) - 



Hence, 



dfp.{y,8). l0 2 f P (y) l0 2 f P ,(y) 



05 lS ~° 2 Oy 2 2 dy 2 
One can now follow the same proof as in [2] to prove the identity. Basically, we consider the entropy 
of P' namely 

S(fp>) = -k fp,(y,5)\nf P ,(y,5) dy. (28) 

J — oo 

Differentiating the above equation with respect to 8, we get 



|(5=0- 



(26) 



(27) 



05 



05 



05 



-4 jy, M iy-k]y }P , M a J^ s i<iy. 



The first term can be seen to be zero since fp'(y, 5)dy = 1 and, hence, 



05 



dS(f F 
05 



1 5=0 



-oo 
oo 



05 



-k I lnf P ,{ y ,8) d J4^dy\^. 



05 



Substituting ( |27] ) in the above equation, we get 

0S(fp 



05 



I <5=0 



k r, , , , 2 fp(y) , 
in My) Q .. 2 % 



Now, integrating the RHS of the above equation by parts, we get 



05 



1 5=0 



k 
2 
k 
2 



-ln/p(y) 
-In My) 



gMy) 

gMy) 
9y 



df P (y) 



-oo + k J fp(y) V ^// 

+ fj(/p). 



The first term can be shown to be zero since it can be written as | 



8fp(v) 



(29) 
(30) 

(31) 

(32) 
(33) 

and 



9 Spiv) 
00 / dy 



dy = J(fp), which is bounded. The second term is zero at both 



8fp(y) 

, dy is bounded since f , , 

y = 00 and y = —00 since fp(y) — > as y — > 00 and 2; In 2; — > as z — ► 0. Hence, we get the desired 
result in the lemma. □ 

The relationship between thermodynamic temperature and Fisher information is given in the following 
theorem. 

Theorem 3.1: 

v (34) 



1 2mdS(f pi ) 



9 3N 05 



mk 
3N 
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Proof: Since the perturbation is independent in each dimension, we can apply Lemma 3.1 (de Bruijn 
identity) to each component of the vector valued random variable, which gives the desired result. □ 

Example 2: The difference between the kinetic temperature and the thermodynamic temperature is 
brought out in this example. Let the momentum be a scalar random variable P whose probability density 
function given by 

f P (y) = -^\^ (y -» ? + e-^^ 2 }, 

i.e., fp(y) is the mixture of two Gaussian distributions with variance a 2 and means [i and —fi. It is easy 
to see that the mean corresponding to fp(y) is zero and that the energy is S = fi 2 + a 2 . Hence, the kinetic 
temperature for a given /i and a 2 is 

T(^a 2 ) = \^ 2 + a 2 ). 
mk 

The thermodynamic temperature however is related to the Fisher information associated with the 
distribution and is given by 



mkJ(fp) 

which can be numerically evaluated for a given \i and a 2 . 

In Fig. [T| we plot the kinetic temperature and thermodynamic temperature for a 2 = 1 as is /i varied. 
As /i increases, the distribution varies from a single Gaussian to a bimodal distributed composed of two 
Gaussians separated by a distance of 2/i. It can be seen that the kinetic temperature increases monotonically 
with fj,, whereas the thermodynamic temperature does not. A qualitative explanation of this phenomenon 
is provided by the observation that the thermodynamic temperature reflects the average "local spread" of 
the distribution. Thus, when \x is close to zero, the thermodynamic temperature is close to the kinetic 
temperature. Whereas, when fi becomes large compared to a, the distribution has two distinct peaks and 
locally has the same spread as that of a Gaussian. Thus, one would expect the thermodynamic temperature 
to return to its original value as fi increases, even though the average kinetic energy (and, hence, the kinetic 
temperature) increases monotonically with [L. 



C. Remarks 

The above example illustrates the fact that unlike the equilibrium definition, the kinetic temperature and 
the thermodynamic temperature defined here show quite different behavior. The difference is especially 
pronounced for distributions such as for the bimodal distribution considered in the example. At this 
juncture, we note that a different notion of temperature has been introduced by Frieden [5], who defined a 
"Fisher Temperature" as the derivative of the Fisher information associated with the given distribution with 
respect to any observable. Such a definition will give different values of Fisher temperature depending on 
the observable used. On the other hand, the notion of thermodynamic temperature defined here is directly 
related to classical definitions, and uses the derivative of the classical entropy function with respect to 
energy. It is for this reason that we refer to the proposed definition as the "thermodynamic" temperature. 
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Fig. 1. Kinetic and Thermodynamic temperature for a mixture Gaussian distribution 



Equation ( |27j ) implies that regardless of the distribution of Q, the newly added momentum diffuses 
through the system in the limit as 5 — > 0. It is this diffusion of the added momentum that increases the 
entropy and the energy of the system as would be expected from any diffusion process. 

IV. Perturbation and Proposed Definition when Momentum is a Discrete Random 

Variable 

In this section, we consider the case when the momentum is a discrete random variable and for the sake 
of clarity, we will consider only the scalar case. We assume that the momentum P can take on values 
nh, for any integer n and a real constant h. Let fp denote the probability mass function of P and, for 
convenience let fp[n) = Pr(P = nh). There are two main reasons why the approach for the continuous 
case cannot be trivially extended to the discrete case. They are 



In the discrete case, one cannot define an additive perturbation such as in (20), since for arbitrary 



values of 5, the perturbed random variable P' will in general not be restricted to the set {nh} 



• The definition of Fisher information in ( 17 ) with respect to the location family requires probability 
density function f P to be smooth and, is hence, not directly applicable to discrete random variables. 

We now propose a perturbation of the discrete random variable P that results in the random variable 
P' which also takes on values in {nh}. Let Q be a discrete random variable also defined on {nh} and 
let }q\p\ = Pr(Q = nh). Further, let Q satisfy the following properties 

• Sq is symmetric, i.e., f Q [-n\ = f Q [n] 

• /q[1]^0 



p 



p f 




(n-l)h 

7qM 



nh 

/g,i[lN*f (n+ l)/i 



M [ i _j] = Pr( p' = i / i |p = i& ) 



Fig. 2. Equivalent communication channel between P and P' 



•Am 



(35) 



For any 5 > 0, let us define /Q,,s[n] as follows 

i-E:=i(/gH + /gh# R = i 

Now, consider a random variable P' whose PMF fp> [n] is 

fp> [n,S\ = f P [n] ® / Qi j [n] , (36) 

where ® refers to discrete convolution. The random variable P' can be thought as the output of a 
communication channel whose input is P and the transition probabilities in the communication channel 
are given by /Q j( j[n] as shown in Fig. [2j That is, Pr(P' = &7i|P = j/i) = /q^^ — j). 

Let H(fp) = —kJ2^=-oofp[ n \^ n fp[ n \ denote the entropy corresponding to the probability mass 
function fp (note that we use H instead of S, in accordance with standard notation in information theory 



for discrete random variables). The perturbation in (36) can be shown to have the following properties 

1) H(f P ,) > H(f P ) 

2) Etfp.) > €(f P ) 

The proof of Theorem 4.1 in the next section essentially proves these properties also. Since this is 
developed in more detail in the next section, the proof is omitted here. 
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We now formally define the temperature as 

1 H(f pi ) - H(f P ) 

a = i im eft \ at \ ■ ( 37 ) 

9 5-o £(f P ,) - £(f P ) 

We now show that this quantity can be expressed in terms of the relative entropies between fp[n], 
fp[n + 1] and f P [n — 1], where f P [n + 1] and fp[n — 1] refer to the distribution fp[n] shifted by one unit 
to the left and right, respectively. We will first show the following lemma 

Lemma 4.1: For any symmetric perturbation fq and any probability density function f P such that 

/ P [i]^0,Vi, 

fAn, 6] = f P [n] + 5f Q [l](f P [n - 1] + f P [n + 1] - 2) + o(5 2 ). 
Proof: A Taylor's series expansion of fp>{n, 5] with respect to 5 around 5 = gives 

/fK<5]=/pH+5 ^ | 5 =o + o(5 ). 

To prove the lemma we only need to show that d/>Ml j tf=Q = /g[l](/p[n — 1] + /p[n + 1] — 2). Let 
(j) P (s),(j)Q t s(s) and 4> P (s,8) be the moment generating functions corresponding to the distributions /p[n], 
f Qj5 [n] and /jw[n,(J]. Then, 

<MM) = MMqM = Ms) [J2 e ~ Snk f^ 

\ n 

From p5j ), we get 

oo oo 

= I>~* n %>] = 1-2/q[1]5-2^ / Q [i]^+(e- sft + e^) / Q [l]<5 ]T (e^ + e + "^) / Q [n]<f\ 

n i=2 n=2 

Hence, 

(oo \ oo 

1 - 2f Q [l]5 - 2 fQ\¥ +Ms) {e~ sh + e sh ) f Q [l]6M*) E + ^ ^N^- 

i=2 / n=2 

Grouping all the terms according to the exponents of 5, we get 



\s\n\ 



M{s,S) = Ms) + (e- sh + e sh -2)f Q [l}SMs) + o(S 2 ), (38) 
d(f)p'[n, 5} 
^ 05 

Now taking the inverse Laplace transform, we get the desired result 



|i=o = Ms){e- sh + e sh -2)f Q [l}. (39) 



^^%=o = f Q [l]{f P [n - 1] + /p[n + 1] - 2). (40) 

This results essentially means that, in the limit of 5 — ► 0, it suffices to consider perturbations for which 
only /q[-1],/q[0], and f Q [l] are non-zero. Since / Q [-l] = f Q [l] and / Q [-l] + / Q [0] + f Q [l] = 1, the 
perturbation is of the form /q[— 1] = 7, /q[0] = 1 — 27 an d /q[1] = 7- We could have obtained this 
result without the use of the moment generating function by directly considering the convolution of fp 
and fg s and then taking the limit of 5 — > 0. However, the use of the moment generating function makes 
the derivation for the discrete case similar to that of the continuous case in Section [III]. 
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We now show that our definition of temperature in (37) is closely related to the relative entropies 
between the fp[n] and its shifted versions. This is made precise in the following theorem 

Theorem 4.1: Consider a perturbation /q with fg[— 1] = /q[1] = 7 and /q[0] = 1 — 27. Then, the 
inverse of thermodynamic temperature is 

\ - s Wrm = ^ D(Mn + mM + Di,p[n ~ mM ' 

where fp[n — 1] and fp[n + 1] refer to the probability density function fp[n] shifted by one to the right 
and left, respectively. 

Proof: For the perturbation under consideration f P i is given by 

f P ,[n] = (1 - 2 1 )f P [n] + 7 (/ P [n - 1] + f P [n + 1]). 

Let us first consider the difference in the energy 

£(fp>) - B{fp) = ^ J2( nh fV p ' ~ M = h 2 Y,n 2 l(2fp[n] - f P [n - 1] - f P [n + 1]). 

n n 

Writing J2 n n2 fp[ n ~ 1] as ~~ -Q 2 + — 1)/p[ti — 1] and similarly, writing n 2 f P [n + 1] as 

+ !) 2 + 2n - l)/p[n + 1] and simplifying, we get 

7/1 2 



E{fp<)-E{fp) 



m 



Now, let us consider the term H(f P /) — H(f P ): 
H(f P ,)-H(f P ) = -k^2fp,[n]]nfp,[n] + k^2fp[n]]nfp[n\ 

n n 

= -k ((1 - 27)/pN + ky{f P [n -l] + f P [n+ 1])) In f P ,[n] + k ^ /pN In / P [n]. 

n n 

Since if(/ P ) = -kJ2 n f P [n+l]\nf P [n + l] = -kJ2 n f P [n - 1] ln/ P [n - 1], we can add 2ff(/ P ) + 
/p[ n + -*-] m fp[ n + 1] + fp[ n ~ 1] m fp[ n ~ 1] to tne r i§ nt hand side without affecting the result. 
Then, rearranging terms, we get 

H(f pi ) - H(f P ) = k(l - 2 1 )D{f P \\f pi ) + kj (D(f P [n + 1]||/p) + kD(f P [n - l]\\f P ,)) . (42) 

Writing a Taylor's series expansion for the first term D(f P \\f P /) with respect to 7 about 7 = 0, we get 

D(f P \\f p/ ) = D(fp\\f P ) + 1 dD ^ fp ' ) | 7=0 + o( 7 2 ). (43) 

Notice that £>(/p||/p) = and ^&Mpl1 |^ =q = q since for a given f P , D(f P \\f P r) is continuous and 
convex in f P i with a minimum at f P = f P i which occurs that 7 = [2]. Hence, -D(/p||/p') = o(7 2 ) and, 



using this in (42) and the fact that lim 7 ^ f P > = f P , we get 

lim o H f/ f P '\ " f(H = ^Difp[n + l]\\fp) + Difp[n-l]\\fp). (44) 



Note the similarity between this result and the relation in (18). In the limit as h — > 0, we recover the 



well known result in (18) for the continuous case. 
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V. Conclusion 

In this paper, we have demonstrated that the notion of thermodynamical temperature can be extended 
to non-equilibrium distributions in a relatively straightforward way for both the discrete and continuous 
cases. In each situation, we introduce a perturbation which is "diffusive". A key point to note is that 
this definition of thermodynamical temperature retains all the features of the classical thermodynamic 
temperature without the need for any hypothesis of equilibrium. In other words, it is completely general. 
Although the ideas have been developed for the case of a system where the distribution is defined on 
the momenta alone, it is a trivial matter to extend it to joint distribution functions. Specifically, we can 
define a temperature field by using the condition distribution of the momentum given the position of the 
particle, i.e., fp^. 
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